|
|
Accession Number |
TCMCG064C17478 |
gbkey |
CDS |
Protein Id |
XP_011083872.1 |
Location |
complement(join(4921001..4921264,4921616..4921750,4921826..4922426,4922804..4923063,4923143..4923480,4923619..4923955,4925209..4925499,4925936..4926142)) |
Gene |
LOC105166266 |
GeneID |
105166266 |
Organism |
Sesamum indicum |
|
|
Length |
810aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA268358 |
db_source |
XM_011085570.2
|
Definition |
THO complex subunit 5B isoform X1 [Sesamum indicum] |
CDS: ATGGAAGTGACGATGGCGGAGCCCGGTGAAATACTGCCGGAGCGCAATGTAGACATGGCGGCGCTCTACGACATGCTGCGGTCGAGCAAGGCTTCGGCGGAAGAAATCGTGGCCAAGATGTTGGCCATCAAGAAAGAATCTCAGCCCAAATCCCAGCTCCGGGAACTTGTCACTAGGATTCTCCTCAATTTCGTCACCCTACGTCAGGCAAATAGGTCTATCTTGCTTGAAGAAGACAGAGTAAAAGCAGATACTGAACGCGCTAAAGCACCTGTGGACCTCACAACCTTGCAGCTCCATAATTTGATGTATGAGAAAAATCACTATGTTAAAGCAATAAAAGCTTGCAAAGACTTTAAGACTAAATATCCTGATATTGAACTTGTACCCGAGGAAGAATTCCTCAGAGATGCCCCAGAAGACATTAAAAGCTCCACATTATCAACTGACAGTGCGCATGATTTGATGCTTAAAAGGCTCAACTATGAGCTTTTCCAGCGCAAAGAATTATGCAAGCTTCGTGATAAGTTGGAACTACAAAGAAAAGCTCTCGAAGAGACAATTGCTAACAGGAAAAAGTTCTTATCAAGTCTCCCTTCACACCTCAAAGCTCTCAAAAAGGCATCCTTGCCTGTGCAACATCAGTTGGGGCTTCTGCATACCAAGAAACTAAAGCAGCAGCAATTAGCAGAGTTGCTCCCACCTCCTCTCTACATAATCTACTCTCAGTTACTTGCTCAGAAGGAAGCGTTTGGAGAGAATATTGAACTGGAGATTGCAGGAAGTGTAAAGGATGCACAGGCTTTTGCGCGCCAGCTTGCAAATAAGGACTCTGCTATATTAACAAACTTAGAGAATTCCAAGTTGGAAGATGATGTGCCTGATGAGGAAGACGATGGTCAAAGGAGGAGAAAGCGGCCAAAGAAGGTTCTAAGCAAGGATAACCATGACCAGTCTGGAATATATCAAAGTCATCCTCTTAAAGTTTCCCTCCACATAAGTGATGATGAAGCTTCGGACTTGAACTCAGCAAAACTCATCTCCTTGAAGTTTGAGTTCTTAATAAAGTTGAATGTTGTGTGTGTAGGAGTAGAAGGCTCTGAAGAAGATCCTCAAAACAATATCTTGTGCAACTTATTTCCTGATGACACTGGCCTTGAGCTCCCTCTGCAGTCAGCAAAGCTCTGGATTGGCAATTCTTTTTCATTTGATGATAGGCGAACTTCACGGCCTTACAAATGGGTCCAGCATTTGGCAGGAATTGATGTCTTGCCAGAGGTTTCGCCACTGATTTCAGTCTCTGGAGACTCTAATAGTGAGACTACTAGACACGGTTCTGTTCTGTCAGGTCTGTCATTATATCGTCAGCAGAACAGAGTGCAGACAGTTGTGCAAAGGATTTGTGCTCGTAAAAAGGCTCAGCTGGCTCTTGTGGAGTTACTTGATTCGCTAAGGAAGCTTACTTGGCCTACTTTTACCTGTGAAAGCGTTCCATGGGCTTCATACACTCCACACTGCAATTTGCATGGCTGGCTATCCATGACTTCAGCTGGTAACAGTACTACATCTCTGCCACTGGTTGATGCAGAACAGAGTCAGGGTCCTACAAGTGTCAATGCAGATAGAAACTCTGGTAGGTCCAAGGAGATGGAGACCACAACAGAAGATGGGGAGCTTCCATCTTTGGTTCCAGTTGCTAATGGTGTAAATGATGTTGGACTCACCCCCACAAAAGGATCTGAACTTGAGAATTCCAGAAGGCTGAGTTTGATTTCAAAAAGTATCATGTCCCCAATCAACAAGGGGAAGTCACCAAGTTTTAAGAAGCTTGAGGAGGATGTTGATCTCATGCTGGAATCTGATAATGAGCTTGATGAACCAGTTAAAGTGGAGGAAACATCTGATAATGCATCACCATTGGGAGAACTAGCATTTGTTGACAATTCATGGGCGGACTGTGGGGTTCAAGAATACAGTCTTGTACTAACTCGTAGGTTGGACAATGATGACAGGATTATGAAATTGGAAGCCAAGATCAAAATAAGCACAGAATATCCTCTTAGGCCTCCTCATTTTGGACTGAGTCTTTATAGTTCCTCACAAGGAGAGAACTACTTCGTGTCTAATGGTTCGAGGTGGTACAATGAACTTCGTGCAATGGAGGCAGAGGTCAATGTTCACATAATAAGGATGATACCGTTCGATCAAGAAAATTTAATTCTAGGTCATCAAGTGCTTTGCCTTGCGATGCTGTTTGACTTCTTCGTGGATGATGGGAATCCTTCTGAGAAGCGAAGGTCTACTTCAGTGATTGATGTTGGTTTATGCAAGCCTGTAAGTGGAAGGCTTGTCAGCCGATCTTTTAGAGGTCGGGATCGTAGGAAAATGATTTCATGGAAAGACAACACCTGCACTCCTGGTTATCCTTACTAG |
Protein: MEVTMAEPGEILPERNVDMAALYDMLRSSKASAEEIVAKMLAIKKESQPKSQLRELVTRILLNFVTLRQANRSILLEEDRVKADTERAKAPVDLTTLQLHNLMYEKNHYVKAIKACKDFKTKYPDIELVPEEEFLRDAPEDIKSSTLSTDSAHDLMLKRLNYELFQRKELCKLRDKLELQRKALEETIANRKKFLSSLPSHLKALKKASLPVQHQLGLLHTKKLKQQQLAELLPPPLYIIYSQLLAQKEAFGENIELEIAGSVKDAQAFARQLANKDSAILTNLENSKLEDDVPDEEDDGQRRRKRPKKVLSKDNHDQSGIYQSHPLKVSLHISDDEASDLNSAKLISLKFEFLIKLNVVCVGVEGSEEDPQNNILCNLFPDDTGLELPLQSAKLWIGNSFSFDDRRTSRPYKWVQHLAGIDVLPEVSPLISVSGDSNSETTRHGSVLSGLSLYRQQNRVQTVVQRICARKKAQLALVELLDSLRKLTWPTFTCESVPWASYTPHCNLHGWLSMTSAGNSTTSLPLVDAEQSQGPTSVNADRNSGRSKEMETTTEDGELPSLVPVANGVNDVGLTPTKGSELENSRRLSLISKSIMSPINKGKSPSFKKLEEDVDLMLESDNELDEPVKVEETSDNASPLGELAFVDNSWADCGVQEYSLVLTRRLDNDDRIMKLEAKIKISTEYPLRPPHFGLSLYSSSQGENYFVSNGSRWYNELRAMEAEVNVHIIRMIPFDQENLILGHQVLCLAMLFDFFVDDGNPSEKRRSTSVIDVGLCKPVSGRLVSRSFRGRDRRKMISWKDNTCTPGYPY |